The following instructions are aimed at explaining how to reproduce in Stata 15 the results presented in the paper "REVEALING "MAFIA INC."? FINANCIAL CRISIS, ORGANIZED CRIME AND THE BIRTH OF NEW ENTERPRISES.

1 - Unzip the file "Revealing_Mafia_Inc"

2 - To construct the database employed in the analysis, please open and run the do file "Building_sample" in the folder "DO_FILES", after changing the working directories at the beginning of the do file

3 - To reproduce the main analysis of the paper (i.e Figures 1-7 and Tables 1-3), please open and run the do file "Main_analysis" in the folder "DO_FILES", after changing the working directories at the beginning of the do file. The stata packages that need to be installed before running the code are listed at the beginning of the do file.

4 - To reproduce the analysis presented in the Appendix of the paper (i.e Figures A2-A11 and Tables A1-A6), please open and run the do file "Appendix" in the folder "DO_FILES", after changing the working directories at the beginning of the do file. The stata packages that need to be installed before running the code are listed at the beginning of the do file.



The following list includes a description of all variables employed in the paper with their source(s):

new_std_ln: number of new enterprises per 100,000 inhabitants set up each year at the provincial level. The data are collected by the Italian National Institute of Statistics (ISTAT), which makes publicly available only the number of new enterprises at the provincial level without additional information such as enterprise name and size.

construction_std_ln: number of new enterprises in the construction sector per 100,000 inhabitants set up each year at the provincial level. The construction industry is labeled as F in the Ateco 2002 and the Ateco 2007 classifications. The data are collected by the Italian Chamber of Commerce.

ric_std_ln: number of new enterprises in the sector of professional, scientific, and technical activities (per 100,000 inhabitants) set up each year at the provincial level. Professional, scientific, and technical activities are labeled as M in the Ateco 2002 and the Ateco 2007 classifications. The data are collected by the Italian Chamber of Commerce.

limited_std_ln: number of new enterprises registered as a limited company per 100,000 inhabitants set up each year at the provincial level. Limited companies include the so-called ``Società a Responsabilità Limitata.'' The data are collected by the Italian Chamber of Commerce.

closed_std_ln: number of enterprises per 100,000 inhabitants closed each year at the provincial level. The data are collected by the Italian National Institute of Statistics (ISTAT), which makes publicly available only the number of closed enterprises at the provincial level without additional information such as enterprise name and size.

registered_std_ln: stock of registered enterprises per 100,000 inhabitants each year at the provincial level. The data are collected by the Italian National Institute of Statistics (ISTAT), which makes publicly available only the number of registered enterprises at the provincial level without additional information such as enterprise name and size.

big_banks: percentage of big banks at the year-province level. The Bank of Italy defines big banks as those with a total value of traded funds greater than EUR 26 billion. The data are collected by the Bank of Italy. 

self_emp: percentage of the population that is self-employed over the total number of employed people in each province. The data are collected by the Italian National Institute of Statistics (ISTAT). 

tourism: index of the capacity of a given province to attract tourism-type consumption in a specific year in terms of days spent by tourists within a province per inhabitant. The data are collected by the Italian National Institute of Statistics (ISTAT). 

wastes_xc: per capita number of tons of waste produced at the province-year level. The data are collected by the research center Istituto Superiore per la Protezione e Ricerca Ambientale (ISPRA). 

trial: average length in days of a trial for bankruptcy at the province-year level. Data are available for 2000--2007. For following years, we impute the average provincial values over the eight preceding years (e.g.\ for 2008 we computed the average between 2000 and 2007). The data are collected by the Italian National Institute of Statistics (ISTAT).

pop_urb: percentage of urban population over the total provincial population at the province-year level. The data are collected by the Italian National Institute of Statistics (ISTAT).

newspapers_ln: total number of newspapers sold (per 1,000 inhabitants) at the year-province level. The data are collected by the Italian National Press Agency (Accertamenti Diffusione Stampa - ADS).

blood: number of blood bags per 100 inhabitants at the year-regional level. The data are collected by the Italian National Agency for Blood Donation (Agenzia Volontari Italiani del Sangue - AVIS).

gdp: gross domestic product at current market price at province-year level. The data are collected by Eurostat.

un_rate: number of unemployed persons as percentage of the labor force at province-year level. The labor force includes unemployed individuals plus those in paid or self-employment. The data are collected by the Italian National Institute of Statistics (ISTAT). 

tot_exp_r: share of the total amount of exports with respect to the GDP at the province-year level. Data on exports are collected by the Italian National Institute of Statistics (ISTAT).

proc_std_ln: total amount of public procurement at the province-year level per 100,000 inhabitants. The data contain the universe of public procurement contracts for projects above EUR 40,000. The data are elaborated by De Carolis et al. (2019) from records of the Italian National Anticorruption Authority (ANAC). 

fund_eu: total amount of funds at the province-year level made available under the EU multi-year budget for 2007--2013. The data are collected by OpenCoesione.

co: total number of people resettled from Sicily, Campania, and Calabria across Italian provinces during 1961--1974 under the confino law. For each of the provinces outside the three regions of original mafia, we compute the total number of mafia members hosted following the application of the 1956 law. The data to construct the variable are provided by the Antimafia Commission of the Parliament, and in particular, from its 1976 annual report. 


